Comparing Implementation Features of Map Reduce in RDBMS with Distributed Cluster
نویسندگان
چکیده
Data processing techniques are becoming more innovative as the amount of data grows. Here we are exploring such techniques to process big data one is the traditional RDBMS approach and the other distributed approach. We came across certain advantages and disadvantages of both the approaches. RDBMS is a very highly used technology for data processing by various organizations and replacing it with new technology has a lot of challenges. Distributed processing is the need of the hour and technologies like Hadoop, map reduce etc. [1] is being
منابع مشابه
Distributed Parameter Map-Reduce
This paper describes how to convert a machine learning problem into a series of map-reduce tasks. We study logistic regression algorithm. In logistic regression algorithm, it is assumed that samples are independent and each sample is assigned a probability. Parameters are obtained by maxmizing the product of all sample probabilities. Rapid expansion of training samples brings challenges to mach...
متن کاملEffects of modified atmosphere packaging on some quality attributes of a ready-to-eat salmon sushi
Sushi, a very popular food worldwide became a popular ready-to-eat food selling in supermarkets, but it exhibit distinct features, which are associated with microbiological hazards. Therefore, MAP technology, known to reduce aerobic bacteria in fishery products, was used to improve quality of ready-to-eat salmon sushi in this study. Salmon sushi were packaged with air (control), 50%N2 /50%CO2 (...
متن کاملEffects of modified atmosphere packaging on some quality attributes of a ready-to-eat salmon sushi
Sushi, a very popular food worldwide became a popular ready-to-eat food selling in supermarkets, but it exhibit distinct features, which are associated with microbiological hazards. Therefore, MAP technology, known to reduce aerobic bacteria in fishery products, was used to improve quality of ready-to-eat salmon sushi in this study. Salmon sushi were packaged with air (control), 50%N2 /50%CO2 (...
متن کاملImproving Map Reduce Performance in Heterogeneous Distributed System using HDFS Environment-A Review
Hadoop is a Java-based programming framework which supports for storing and processing big data in a distributed computing environment. It is using HDFS for data storing and using Map Reduce to processing that data. Map Reduce has become an important distributed processing model for large-scale data-intensive applications like data mining and web indexing. Map Reduce is widely used for short jo...
متن کاملAnalysing Distributed Big Data through Hadoop Map Reduce
This term paper focuses on how the big data is analysed in a distributed environment through Hadoop Map Reduce. Big Data is same as “small data” but bigger in size. Thus, it is approached in different ways. Storage of Big Data requires analysing the characteristics of data. It can be processed by the employment of Hadoop Map Reduce. Map Reduce is a programming model working parallel for large c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017